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Abstract — In this work, the problem of cross-tier interference 
in a two-tiered (macro-cell and cognitive small-cells) network, 
under the complete spectrum sharing paradigm, is studied. A 
new orthogonal precoder transmit scheme for the small base 
stations, called multi-user Vandermonde-subspace frequency di- 
vision multiplexing (MU-VFDM), is proposed. MU-VFDM allows 
several cognitive small base stations to coexist with legacy 
macro-cell receivers, by nulling the small- to macro- 
cell cross-tier interference, without any cooperation be- 
tween the two tiers. This cleverly designed cascaded 
precoder structure, not only cancels the cross-tier in- 
terference, but avoids the co-tier interference for the 
small-cell network. The achievable sum-rate of the small-cell 
network, satisfying the interference cancelation requirements, 
is evaluated for perfect and imperfect channel state informa- 
tion at the transmitter. Simulation results for the cascaded 
MU-VFDM precoder show a comparable performance to that of 
state-of-the-art dirty paper coding technique, for the case 
of a dense cellular layout. Finally, a comparison between 
MU-VFDM and a standard complete spectrum separation 
strategy is proposed. Promising gains in terms of achievable 
sum-rate are shown for the two-tiered network w.r.t. the tra- 
ditional bandwidth management approach. 

Index Terms — Interference cancellation, cognitive radio, spec- 
trum sharing, small-cells, linear precoding 

I. Introduction 

Recent academic and industry trends point towards a 
paradigm shift in wireless communications: the adoption of 
a two-tiered network structure (e.g., JT]). Two-tiered networks 
aim at breaking away from the traditional cellular layout to 
provide the expected capacity increase for future wireless 
services. As the name implies, its main difference from the 
traditional cellular paradigm, is the deployment of a second tier 
of densely populated and self-organizing small base stations 
(SBS) 121. In spite of all these qualities, the addition of an 
SBS layer requires coexistence with the existing macro base 
station (MBS) infrastructure. 

Traditionally, coexistence in two-tiered networks is ac- 
complished by means of three different approaches |]3]- In 
complete separation, the MBSs and SBSs operate on disjoint 
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bands, which avoids the mutual interference between the tiers, 
i.e., cross-tier interference, but decreases the spectral effi- 
ciency. To enhance spectral efficiency, the two tiers can share 
part of the total available band, under the partial sharing 
paradigm. In order to work, solutions for cross-tier interference 
control need to be adopted for the shared band. For a maximal 
spectral re-use, the most attractive solution is the complete 
sharing, where the MBSs and SBSs share all the band. Despite 
its notable features, this approach leads to an unbearable 
cross-tier interference, thus requiring interference management 
techniques for the best coexistence of the two network tiers. 

Out of the many techniques that exist for coexistence, a 
popular one named interference alignment (IA) J4), copes with 
cross-tier interference by isolating the received and interfer- 
ence signal subspaces. This requires a smart coordination of 
the devices in the network and special decoding at the receiver 
to realize the alignment. Interestingly, IA-based solutions for 
different channel state information (CSI) assumptions have 
been proposed, requiring the existence of exploitable degrees 
of freedom in the spatial [5 j, frequency |6 | or time ||7] domain. 
Coordinated beamforming J8] based solutions require CSI at 
the transmitter only, with the advantage of no special decoding 
at the receiver. On the other hand, the power normalization 
needed at the transmitter to fulfill the transmit power con- 
straints may result in performance penalties, depending on 
the condition number of the resulting channel matrices, as 
the number of involved MBSs/SBSs grows. Alternatively, in 
the absence of cooperation between the tiers, interference can 
be managed through dynamic spectrum access (DSA) J9). 
DSA strategies such as spectrum shaping iflOl and cooperative 
frequency reuse ifTTI can be adopted at the SBSs, depending 
on the spectrum management approach adopted by the MBS. 

Proposed for a similar problem, cognitive radios (CR) ifTZl 
aim at fostering spectrum re-use by protecting a primary 
(legacy) system from the interference generated by a sec- 
ondary (opportunistic) one. By labeling the MBSs as the pri- 
mary system (first tier) and the SBSs as the secondary system 
(second tier), CR networks can be also seen as a particular case 
of two-tiered networks under the complete sharing approach, 
in which interference protection is inherently asymmetric 
(from the second to the first tier only). An application of IA to 
CR networks has been proposed in contributions such as iTPJl . 
where an SBS is aware of the power allocation performed 
in a multi-antenna MBS and aligns the cross-tier interference 
towards the macro-cell receivers. CR beamforming approaches 
have also been proposed in 04), 03], where multiple spatial 
dimensions at a cognitive transmitter are used to mitigate the 
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cross-tier interference. This is accomplished by satisfying a 
signal to interference plus noise ratio (SINR) constraint at the 
primary receivers, while at the same time serving a reasonable 
rate to one or more secondary receivers. 

In this contribution, we specifically target a two-tiered 
system comprised of a long term evolution (LTE) |[T6l 
orthogonal frequency division multiple access (OFDMA) 
MBS and several cognitive SBSs operating under the 
complete sharing approach. The SBS system is modeled 
as a coordinated network multiple input multiple output 
(MIMO) system with infinite backhaul capacity. We propose 
a novel DSA CR technique for this scenario, called multi-user 
Vandermonde-subspace frequency division multiplexing 
(MU-VFDM). MU-VFDM consists of a cascaded linear 
precoder made up by an inner component designed to 
cancel the cross-tier interference from the SBSs to the 
first tier, and an outer component to avoid the multi-user 
interference in the second tier, i.e., co-tier interference. We 
show that, not only OFDMA, but any block transmission 
scheme that deals with multipath interference, provides 
resources that can be exploited by MU-VFDM to cancel 
the cross-tier interference. Under this assumption, the sole 
requirement is perfect CSI at the transmitter (CSIT), used to 
derive the precoder. This contrasts with the aforementioned 
state-of-the-art techniques, that either require available time, 
space or frequency resources, or cooperation between the tiers 
to be performed. Sum-rate enhancements are shown to be 
achievable w.r.t. to the legacy complete separation approach, 
for a large range of signal to noise ratio (SNR) values, 
regardless of the number of SBSs. Additionally, the impact 
of imperfect CSI at the transmitter is evaluated, providing 
important design insights. 

This paper is organized as follows. In Sec. [II] we introduce 
the general MBS/SBS model assumed throughout this paper. 
Then, we derive the precoders and briefly discuss their per- 
formance in Sec. [TTTI In Sec. IIV1 we present some numerical 
results for our MBS/SBSs study case. Finally, conclusions and 
future research directions are discussed in Sec. [V] 

In this work, we adopt the mathematical notation as de- 
scribed in the following. A lower case italic symbol (e.g. b) 
denotes a scalar value, a lower case bold symbol (e.g. b) 
denotes a vector, an upper case bold symbol (e.g. B) denotes 
a matrix. [B] m n denotes a matrix element at the m th row and 
the n th column. An Ijy denotes the identity matrix of size N . 
The transpose conjugate operator on a matrix is denoted by 
the H superscript (e.g. B H ), the transpose operator is denoted 
by the T (e.g. B T ), the Moore-Penrose pseudoinverse matrix 
is denoted by f (e.g. B^), ker (B) denotes the kernel of the 
matrix B and tr(B) its trace. The operator A <g) B is used 
to represent the Kronecker product. The special matrix On,m 
denotes the zero matrix of dimension N x M. All vectors are 
columns, unless otherwise stated. 

II. System Model 

Consider the downlink scenario in Fig. Q] where all com- 
munications are assumed to be in time division duplex (TDD) 
mode. An MBS and K cognitive SBSs, transmitting over the 
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Figure 1. MU-VFDM downlink model, two-tiered network. 



same frequency band, are deployed in a given area. The MBS 
serves M single-antenna macro-cell user equipments (MUEs). 
The SBSs are considered to cooperate, yielding a full network 
MIMO transmission system model IfTTI . For simplicity, and 
without loss of generality, we assume that each SBS serves one 
single-antenna small-cell user equipment (SUE). Concerning 
the notation, subscript "m" refers to the MBS, while "s" 
refers to the SBSs, i.e., hjm (or Hsm"^) represents a link 
from SBS % to MUE j. Conversely, (or H^ ] ' j) ) denotes 
a vector/matrix related to the transmission from any SBS 
except i. All channel vectors h £ CAf(0, Il + i/(L + 1)), 
irrespective of the tier, transmitter and receiver, represent the 
impulse response of independent and identically distributed 
(i.i.d.) frequency-selective Rayleigh fading channels composed 
of L + 1 paths. 

The MBS adopts an M-user OFDMA based transmission of 
block size N + L and a cyclic prefix of size L. For simplicity, 
a uniform resource allocation of N/M subcarriers per MUE 
is adopted, Afj being the set of subcarrier indices assigned to 

M M 

the j th MUE with [j Afj = {I, . . . , N} and f] Afj = 0. As 

a consequence, each MUE selects its own set of subcarriers 
by means of an N x N mask receiver filter Bj, such that 



tr(B,) 



IJV, 



with fB 



1 



M 

N/M and ^B 

i=o 

when the subcarrier n is allocated to the j th MUE and zero 
otherwise. Let F £ C" x,v be a unitary discrete Fourier 
transform (DFT) matrix with [F]( fe+1 ; +1 ) = -j=e~ l2 '*~x for 
k, I = {0, . . . , N - 1} and A a (N + L) x N cyclic prefix 
insertion matrix given by 
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The channel matrix representing the link from the MBS to the 
j th MUE, after the cyclic prefix removal operation, is defined 
as T(h{ n 1 m j) ) £ C Nx( - N+L \ where T(-) is a Toeplitz operator 
that returns a Toeplitz matrix built from a given vector, i.e., 
for h= [h(0) ■ ■ ■ h{L)\. 
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Concerning the second tier, the SBSs adopt a block transmis- 
sion scheme that will be detailed in Sec. IIII-AI We assume 
that an SUE is not different from an MUE with respect to the 
reception chains, being distinguished merely by the association 
point (MBS or SBS). Therefore, like the MUEs, the SUEs 
discard the leading L symbols and perform a DFT at the 
reception. Naturally, no mask filter is needed at k th SUE, 
given that, in general, no OFDMA-based transmission can 
be performed in the second tier without generating cross-tier 



interference towards the MUEs. Let 7~(him ) G 
be the matrix representing the channel from the i th SBS to the 
j th MUE, constructed from the hsm^ channel coefficients. The 
matrices T(h^ fc) ), T(h s ( P } ) e C Nx( - N+L \ representing the 
link from MBS and the i th SBS to the k th SUE respectively, 
can be similarly constructed. 

Now, let ym , ys be the received iV-sized vector at the 
j th MUE and k th SUE, respectively, s m be the MBS input 
vector of size N, composed of M individual zero mean, unit 



^Nx(N+L) 



norm symbol vectors s„ , j <G [1, M], and Xs* J be the transmit 
vector at the i th SBS, of size N + L, detailed later for clarity. 
Then, if we let n s (fe) - CAf(0,a 2 I N ) be two additive 
white Gaussian noise (AWGN) vectors, the received signals at 
the j th MUE and k th SUE can be expressed as 



yg =B j F(T(h&p)AF- 1 8 n 



A" 



+ £T(h^' ) )xW+n 

i=i 

y« =F(V(h<p))x« + 



(.;) 



(3) 



+ J2 T(h£'«)x« + r(h^ fc ))AF- 1 s m + np>) . 

(4) 

Note that, in |@), we represented yi^ by separating the 
useful signal received from the i th SBS from the co-tier 
interference component generated by the remaining K — 1 
SBSs, operating in the second tier. For yi J ', the all of the 
second tier transmitted signal is seen as interference. 

To simplify the subsequent analysis, consider an equivalent 
aggregate model that includes all users in the system. Let 
us start by looking at the first tier. By summing up all 
the contributions of the MUEs, orthogonal in the frequency 
domain, the equivalent channel matrix from the MBS to the 
MUEs is 

M 



H ffim = ^B i FT(h^)AF- 1 g 

3=1 

Let us now define 



-NxN 



(5) 



M 

H&° = E B^Hhs^) G C N ^ N+L \ (6) 

3=1 

then the equivalent aggregated interference channel from the 
SBSs to the MUEs is constructed as 



H sn 
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sm , • ■ • , AA sm 



Jj(-KV) g t£NxK(N+L) 



(7) 



Switching our focus to the second tier, let H s l s 
Then, by defining 
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the equivalent aggregated channel from the SBSs to the SUEs 
can be written as 



H ss = (I K ® F)H SS e 



^KNxK(N+L) 



(9) 



The interfering link from the MBS to the SUEs is 
obtained by following the same approach. Let 
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(i,fc) 



T(h^ fc) )AF _1 G 
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tt(1.1) 
■Tims 

H (l,2) 
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By defining 



-<KNxN 



(10) 



we can write the equivalent aggregated channel as 



(I 



K 



F)H ms e 



-KNxN 



(ID 



Now, we define y m = y^' as the aggregated received 

3=1 

vector at the MUEs of size N, and y s = [yP , . . . , ys ] T 
as the aggregated received vector at the SUEs of size KN. We 
also define x s = [xj 1 " 11 , . . . , xi K ^ J ] T as the aggregated transmit 
vector at the SBSs, of size K(N + L). The equivalent signal 
model is then obtained as 



ym Hmm^m Hsm^s ^ra 

y s = H ss x s + H ms s m + (I K 



F)n s . 



(12) 
(13) 



Note that, in O and O, n m = ^B^Fn^ and 



.(1)T 



,(if)T,T 



3=1 



are the aggregated AWGN vectors 
of the first and second tier, of size N and KN respectively. 

III. Precoder Design 

According to the cognitive overlay paradigm JT8], the 
secondary system must protect the primary network from 
the interference caused by the opportunistic transmission. By 
looking at (fT2] i. we see that this implies 



H sm x s = 0. 



(14) 



The transmitted message by the MBS to the MUEs is not 
known in the secondary system, which disqualifies algorithms 
like dirty paper coding (DPC) |[T9l . Furthermore, the SBSs 
possess no information about unused resources (time, space or 
frequency) at the primary system and each MUE is a single 
antenna device. Therefore, traditional techniques to design an 
interference-free transmission H- lfPTl can not be implemented 
in the considered scenario. 
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Let Ss be the input symbol vector at the i th SBS, detailed 

,s^ )T l T be 



later for clarity. Consequently, let s s = [si 
the aggregated SBSs' input symbol vector, such that 

x s = Es s 



(15) 



becomes its precoded version through a linear precoder E, 
whose design is discussed in the following. Then ( TT4T > can be 
rewritten as 

H sm E = 0. (16) 

If we assume that each SBS may independently precode its 
input vector to cancel the interference towards the MUEs, we 
can express E as the direct sum 11201 of K precoders 



K 



E = 0Ei, 



(17) 



where Ej is the precoder at the i th SBS. It is straightforward 
to see that when the following holds 

H&>Ei = 0, Vie [1, if], (18) 

( [Tol l is always satisfied, if perfect knowledge of Hsm is 
available at the i th SBSs. Thus, the SBSs do not need to 
share any information related to the cross-tier interference 
channels towards the MUEs to create E. This results in a 
simpler architecture as well as in a lower backhaul signaling. 
As a consequence, we can focus on the i th SBS to devise 
Ej and then apply (JT7J to find the desired overall precoder. 
Moreover, we note that a CSI measurement is valid only 
throughout the coherence time of the channel of interest, 
e.g., Hs,*,. Therefore, we must seek for one-shot strategies 
that do not require iterative procedures between the SBSs and 
the SUEs/MUEs to derive the precoding/decoding matrices, 
such as the IA-based solutions in 1211 and references therein. 
At this stage, we assume perfect CSIT related to the interfering 
links from the SBSs towards the MUEs. In the second part of 
the work, the impact of imperfect CSIT will be analyzed. 

A. Single SBS/SUE Precoder Design 

We first focus on the pair given by the i th SBS and its 
SUE k, thus a scenario as in Fig. [2] i.e., K = 1. In l22l . 
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Figure 2. OFDMA downlink interference channel model, single SBS. 

we proposed a linear precoder called VFDM to solve the 
interference cancelation problem in a single primary, single 



secondary scenario. This kind of situation meant that the 
overall channel matrix was of Toeplitz structure T(h) with 
h = [h\,...,hi], allowing a linear precoder based on a 
Vandermonde matrix ll23l to be constructed from the roots 
of the interfering channel's h polynomial given by 



S(z) 



i=0 



(19) 



Unfortunately, unlike in [22), herein the considered SBS deals 
with a multi-user OFDMA downlink. Due to the multiple 
interfering links from the SBS to the MUEs, no polynomial 
representation of the equivalent channel is possible, and the 
Vandermonde-subspace based result is not directly applicable 
to our case. Nevertheless, the null-space precoder idea can still 
be used, as shown in the following. 

By looking at ( |18t . we note that, if this precoder exists, 
then it must lie within the kernel of Hsm ■ In the considered 
scenario, the redundancy introduced at the MBS, to combat 



the multipath interference, ensures that rank (H 



(vh 



N, 



thus dim (ker (Hs m - 1 ) ) = L. Therefore, the non-emptiness of 
the kernel is guaranteed by the block transmission structure 
adopted in the first tier, i.e., OFDMA, and a solution to 
CEH} can be found. Now, let H^' } = L^Q^ be the LQ 
decomposition of the equivalent channel matrix representing 
the interfering link between the i th SBS and the MUEs, 
where l4m G i£ Nx i N + L ) is a lower triangular matrix and 
Qsm ) G C (Ar+L)x(Ar+L) is a unitary matrix given by 



= [ qi I q 2 



(20) 



By construction, we know that the last L orthonormal columns 
of Qsm lie within kcr(H S m )■ Therefore, if we define 

Ej ^ [ qjv+i | ••• | q (J v+L)-i I qjv+L ] G C^+ L ^ L , 

(21) 

we have an orthogonal precoder that fulfills (1181 . If we 
substitute (1211 into ( fTTI i. we see that the precoder E is obtained 
as a K(N + L) x KL matrix, whose dimension determines 
the size of the previously defined aggregated zero mean, unit 
norm SBSs' input symbol vector s s , i.e., KL. 

We first focus on the macro-cell. If we plug (fT5t into (fT2l i. 
then we obtain 

y m = HmmSm + I'm, (22) 

realizing the desired cross-tier interference cancelation. Note 
that, in (EJ, u m e CAf(0,a*I N ) is the DFT of the AWGN 
vector n m , having the same size and statistic. 

Concerning the received signal at the considered SUE, we 
can rewrite (|4) as 



y (k) = FH (p)E iS » + FH^'sm + i/< 



(23) 



where the co-tier interference component is absent, being the 
focus of the section on a single SBS/SUE pair. In (l23l . Ej is 
a linear precoder as defined in (|2H . H^'s,,, is the cross-tier 
interference coming from the MBS and <E CAf(0, c^I/v) 
is the DFT of ni k) . 

(i) 

At this stage, the dimension of s s , zero mean, unit norm 
input symbol vector at the i th SBS, is clear. In particular, the 
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size of s s , i.e., KL, implies that is an L-sized vector. 
Consequently, each SBS has an implicit upper bound (L) on 
the number of input symbols that can be precoded by Ej. 
This, together with the perfect CSIT assumption, is the cost 
of the cross-tier interference cancelation constraint induced 
by the overlay cognitive approach. This guarantees absence 
of cross-tier interference at the MUEs if perfect CSIT is 
available at the SBSs, differently from the underlay approach 
adopted by the cognitive beamforming solutions discussed in 
Sec. U We note that, unlike other interference management 
schemes that exploit the spatial degrees of freedom by the 
use of multiple antennas, i.e. zero forcing (ZF), and/or special 
decoding strategies at the receiver, i.e., IA, the proposed 
technique requires only one antenna per SBS and MUE and 
legacy OFDM decoding at the latter. In fact, the interference 
towards the primary system is canceled by adopting a precoder 
Ej that opportunistically exploits the redundancy introduced 
by the MBS to combat inter symbol (ISI) and inter block 
interference (IBI), e.g. the cyclic prefix. In the following, 
we start from these findings to analyze the multi SBS/SUE 
scenario described in Sec. [TT] 

Finally, we note that, the complexity of the LQ 
decomposition of an N x (N + L) matrix, e.g Hsm'', is 
0(N(N + L) 2 -(N + L)N 2 + (N + L) 3 ) J23]. Consequently, 
a centralized approach to find the null space of 
the aggregated cross-tier interference channel matrix, i.e., H sm 
in (|7]i, would require an LQ decomposition of complexity 
0(N[K(N + L)} 2 - [K(N + L)]N 2 + [K(N + L)} 3 ), 
growing exponentially with K. Therefore, the distributed 
nature of the proposed E^ precoding not only reduces the 
backhaul signaling requirements, but dramatically decreases 
the complexity of the processing in the second tier, where K 
low complexity LQ decompositions are performed in parallel 
to derive the K individual precoders. 

B. Multi SBS/SUE VFDM Precoder Design 

As seen in Sec. IIII-AI the SBSs separately design the 
precoders E^, Vi £ [1, K], such that the overall precoder E as 
shown in (T7\ successfully satisfies (TBl i. As a consequence, 
we can rewrite the signal model in (fTZt and ([Lit as 

y m = H mm s m + n m (24) 
y s = H ss Es s + H ms s m + u s ., (25) 

with u s = [f s , • • • j i / s '' 1 ' T ] T - We focus on the second tier 
and, for clarity, we simplify the notation by introducing 

H ss = H SS E g c KNxKL . (26) 

The structure of the received signal is the same for any SUE, 
hence we can rewrite (l23l l for the multi-user case as 

y« = FH(p)s« + b£ W s® + FU^s m + , (27) 

in which we identify a useful component, two interfering terms 
and the thermal noise. In (|27), H s ( f ] ' ft) s[* ] g (CNx.(K-i)L 
represents the co-tier interference experienced by each SUE. 
Clearly, the performance of the second tier hinges on the 
mutual interference between the SBSs and is strongly inter- 
ference limited as K increases. Note that, as in the single 



user case, the absence of cooperation between the two tiers 
implies that the MBS' interference on the SUEs is always 
present. Consequently, in this scenario, each SUE deals with 
a stronger interference if compared to the single SBS case in 
Sec. IIII-AI To address this issue we assume that the SBSs 
may communicate over an infinite-capacity backhaul realizing 
a coordinated network MIMO system. Despite being hardly 
realistic, this assumption is usually made in similar scenarios 
for first studies on newly-proposed algorithms, to put focus 
on ultimate bounds of such solutions and achieve a better 
understanding of their potential |[T7| . 11241 . The cooperating 
SBSs can be therefore modeled as a MIMO broadcast channel 
(MIMO-BC), whose capacity is given by DPC O, a difficult 
to implement technique. Because of its complexity, many 
suboptimal but linear strategies have been introduced lately. 
Accordingly, we propose to address the co-tier interference 
problem at the cooperating SBSs by adding one linear sub- 
optimal precoding layer, resulting in an overall cascaded 
MU-VFDM precoder, as detailed in the following sections. 

C. Dimensionality Problem and Linear Techniques 

Having solved the cross-tier interference problem, now we 
devote our attention to mitigating the co-tier interference by 
means of a linear suboptimal precoder. As such, in (l25i . 
we focus on the SBSs' transmission by isolating the term 
H ss of dimension KN x KL, as defined in (1261 ). Note 
that, in any block transmission system, the added redundancy 
L to the block of N useful symbols is always such that 
-4 < 1, for matters of efficiency. As seen in Sec. IIII-AI 
MU-VFDM imposes a dimensionality constraint to the trans- 
mitters in the second-tier since each SBS precodes up to L in- 
put symbols while each SUE receives TV symbols. This implies 
that a direct application of techniques such as zero forcing 
beamforming (ZFBF) 11261 or block diagonalization (BD) 11271 
is not possible, since both require that the number of columns 
(transmit dimensions) of the channel matrix be bigger or equal 
than the number of rows (receive dimensions). Regularized 
inverse beamforming (RIBF) [28] is applicable, but it achieves 
poor performance at high SNR, due to the aforementioned 
dimensionality issue (N > L received symbols) that yields 
a very poor condition number to the equivalent channel rep- 
resentation built upon Toeplitz matrices. Matched filter (MF) 
precoding [29] performs similarly, being largely suboptimal 
at high SNR. It is known from 11301 . and for the multiple 
beams case from |3"TI . that opportunistic random beamform- 
ing (ORBF) based techniques are able to yield the optimal 
capacity scaling of KL log log KN in dense networks with a 
large number of receivers. Unfortunately, in our scenario the 
ratio j; is such that we can not achieve good performance 
using these techniques. In general, most of the results in the 
literature regarding linear precoding techniques under given 
optimization criteria assume only one antenna/symbol at the 
receiver. For this reason, a direct extension of these techniques 
is not possible. 

Algorithms like iterative regularized block diagonalization 
(IRBD) l32| deal with multiple symbols/antennas at each re- 
ceiver. The higher experienced diversity gain is due to the sup- 
pression of the interference only between the symbols received 
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by two different receivers. These algorithms perform better 
than other techniques that rely on the single antenna/symbol 
assumption. On the other hand, they require a joint receiver 
decoding, with a consequent increase in the complexity of the 
receivers' architecture. 

Simpler solutions, implemented to deal with an arbitrary 
number of dimensions at each receiver, are user/antenna se- 
lection based algorithms. It is known that by scheduling only 
a subset of antennas or eigenmodes |[27| to be served using 
a classical ZFBF, the achievable sum-rate is asymptotically 
optimal 11331 . In spite of this, the condition for the asymptotic 
optimality is never met in a MU-VFDM system, thus neither 
an exhaustive search of the optimal subset nor a faster and 
suboptimal greedy selection algorithm [|34] can achieve good 
results. 

Looking at the schemes presented thus far, we note that 
the inherent dimensionality constraint limits the performance 
of the second tier, in terms of both achievable sum-rate and 
complexity of the SBSs/SUEs. Starting from this considera- 
tion, we propose a low complexity solution to overcome the 
dimensionality constraint and manage multi-user interference 
in the following section. 

D. RIBF Flexible Network Solution 

Consider a flexible approach to the second-tier deployment 
in which the network designer can modify the dimensionality 
of the system by installing more antennas at each SBS/SUE, 
or alternatively by increasing the SBS' density. We let j tx , ^ rx 
be two parameters such that jtxL,"f r xN <E N are the number 
of transmit and receive dimensions respectively, with L and 
N fixed due to the OFDMA symbol structure. This way, the 
network designer can tune jtx and ^ rx to capitalize on the 
flexibility of the model, effectively changing the number of 
available channels for the transmission, and obtaining different 
operating scenarios. For instance, when j t x = 1 and j rx grows 
large, the system experiences a large increase of the number of 
receive dimensions, i.e., implying a greater number of SUEs 
(or SUEs' antennas) from which the best ones to serve are 
selected, and this represents the condition under which ORBF 
is optimal (a very "tall" overall channel matrix). Conversely, 
if 7„- is kept constant (~f rx = 1 for simplicity) and we let 
j tx increase, the SBSs can exploit the abundance of transmit 
dimensions to achieve a higher transmit diversity, thanks to the 
greater number of considered channels. Another interesting 
configuration is given by 7f X = N and ~f rx = L, that is a 
network where the number of transmit and receive dimensions 
coincides, i.e., channel inversion based techniques such as 
ZFBF and RIBF become efficient in terms of degrees of 
freedom exploitation. These strategies do not require iterative 
or greedy algorithms to be implemented, and thus, represent 
an attractive solution to manage the co-tier interference by 
means of a one-shot technique. In particular, it is known from 
11281 that RIBF offers better performance for a wider class of 
channels, regularizing the matrix to be inverted whenever its 
condition number is poor. Consequently, in the following we 
will focus on RIBF, and we note that it can be implemented 
effectively in the considered scenario if the dimensionality 



constraint is overcome, thus if the following holds 

ItxL > -f rx N. (28) 

Then, without loss of generality, we let j rx = 1 and j t x 
increase. In particular, we note that this preserves the legacy 
number of antennas per SUE, i.e., 1, and their disjoint 
decoding strategy. Due to the large number of SBSs (or 
antennas per SBS), we consider a uniform power allocation 
strategy to reduce the computational burden for the SBSs. We 
remark that, thanks to the ^ rx and ~f tx tuning, the second 
tier is characterized by a greater number of channels. As a 
consequence, in the new setup, s s is a vector of size j t xKL, 

E G £lt*K(N+L)y. ltI KL and jj £ £ KNx lt!C KL _ At (Ms 

stage, we can define 

2 

* = nl&iKN + "Ml (29) 

' s 

as the joint RIBF precoder, with * € £-/^klxKN_ Then ^ if 
we let u s e C K Nxl be a new aggregated SBSs' input symbol 
vector, such that s s = *&u s we can rewrite the signal model 
given by (l24l and (l25T l as 

y m = H mm s m + n m (30) 

y s = H ss Wu s + H ras x m + i/ s , (31) 

where 

w = E$ g cltJ: K(N+L)xKN (32) 

y / tr(E** H E H ) 

is the overall normalized MU-VFDM cascaded precoder, such 
that tr(W H W) = 1. We emphasize that, the cascaded precoder 
structure is intrinsically different from that of our previous 
work in l22l . even for the K,M = 1 case. In fact, the 
use of an outer linear precoding scheme, while preserving 
the interference cancelation condition towards the first tier, 
substantially changes the dimensionality of the system. 

IV. Numerical Analysis 

In this section, we present a numerical performance analysis 
of the proposed technique. Please note that, according to 
Sec. |II]and|inj the matrices H sm and E are not composed of 
i.i.d. random entries, but are strongly structured. No closed 
form of the eigenvalue/eigenvector distribution is available, 
and a purely theoretical performance analysis can not 
be carried out. Consequently, we proceed by means of 
Monte Carlo based simulations of the considered 
downlink scenario, comprised of an OFDMA/LTE MBS 
in the macro-cell with M = 4 MUEs, and an MU-VFDM 
based small-cell system. For simplicity, we consider the least 
resource-demanding extended mode proposed by the standard 
02), and characterized by N = 128 subcarriers, a cyclic 
prefix of length L = 32, for a total bandwidth of 1.92 MHz. 
Noise and channel vectors are generated as described in 
Sec. mi First we assume that perfect CSI is available at 
the SBSs, afterwards we admit for the presence of noisy 
channel estimations yielding imperfect CSIT Note that, if 
not stated otherwise in the text, we do not consider any 
interference from the MBS to the SUEs to isolate the effect 
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of the MU-VFDM precoder on second tier's performance. 
In particular, this assumption is crucial to evaluate the effect 
of the imperfect channel estimation at the SBSs on the 
performance of the cascaded precoder designed in Sec. [Ill] 
Finally, for the sake of compactness of the notation in our 
plots, we introduce the load rate f3 as the ratio between the 
number of transmit and receive dimensions as defined in 
Sec. I1I1-D1 and given by 



IrxN' 



(33) 



A. Multi-User VFDM 

Consider a small-cell system composed by K = 3 
SBSs/SUEs. Let us assume that the SBSs null the interference 
towards the MUEs by ( |2TT i. then the upper bound capacity 
Cp™ achieved by adopting DPC, for a uniform power allo- 
cation is as follows 



1 DPC — 



N + L 



-3 



loe 



i-KN 



N + L 



(34) 

where B is the considered bandwidth and P m and P s = -Sp- are 
the power per transmit symbol at the MBS and at each SBS, 
respectively. Note that, the adopted model implies that the 
total transmit power per tier is the same, i.e., P m (N + L), and 
the larger K becomes, the lower the power budget available at 
each SBS. This is imposed to model the second tier in compli- 
ance with the lower energy consumption requirements that the 
SBSs will likely have w.r.t. a legacy MBS in 4G networks Q. 
In Fig. [3j Cp™ is compared to the achievable ergodic sum- 
rate C SVM of MU-VFDM where the <& stage precoding is ob- 
tained by some of the linear precoding strategies presented in 
Sec. IIII-CI for SNR e [0,30], including the semi-orthogonal 
user selection ZFBF (SUS-ZFBF) algorithm proposed in 11331 . 
The behavior of the considered linear precoding schemes 
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Figure 3. Rate of the SBSs for different transmit schemes, 
(TV = 128, L = 32 and bandwidth of 1.92 Mhz). 



K 



shows a big rate offset when compared to the upper bound 
given by C^™, and this confirms what has been discussed in 
Sec. lITFCl 



To compute the sum-rate of the small-cell system imple- 
menting MU-VFDM with the RIBF outer precoder, C|^p, 
we need to evaluate the SINR for each of the KN received 
symbols at the SUEs. Let * = . . . , cj) {KN) ]. Let 



ssjjl, 



then we can write 



SINR fs 



[HJj^ifi] denote the j row of H s 



ih„ ( ?V i)|2 



E 



KN ,T-(j) ,(i),o 



trtww")^ 

P B K(N+L) 



Vj£[l,KN] 
(35) 

where the dimension of H ss depends strictly on the value 
assumed by (3. Then, it is straightforward to see that for a 
A'-SBS system the achievable sum-rate, when perfect CSIT is 
available, is given by 



^SUM 



B 



N 



KN 



log 2 (l + SINR (s)j ). 



(36) 



In Fig. |4] we illustrate a comparison between C^™ and 



Cgg* for a 



RIBF 

load rate of f3 = 3, confirming that 
the proposed technique has comparable performance to 
state-of-the-art solutions. In particular, due to the inherent sim- 
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SNR [dB] 

Figure 4. Achievable rate of the SBSs with the MU-VFDM (RIBF) flexible 
network solution, K = 3, = 2.5 (N = 128, L = 32 and bandwidth of 
1.92 Mhz). 

plicity and flexibility of the proposed solution, the SBSs' per- 
formance can be made arbitrarily close to the upper bound, by 
increasing the number of dimensions at the transmit side. We 
remark that, the complexity of the linear precoding techniques 
outperforming RIBF in Fig. [3] prevents their implementability 
for j3 > jr. This consideration further motivates the proposed 
solution for the multi-user VFDM dense network deployment. 

B. Imperfect CSIT 

In Sec. UnJ we showed that when perfect CSIT is avail- 
able at the SBSs, an interference nulling precoder E can 
be designed. However, in a realistic implementation, each 
transmitter in the system performs noisy channel estimations, 
yielding imperfect CSIT. Therefore, in this section, we seek 
for a deeper understanding of the impact of the CSIT ac- 
quisition on the overall network performance. We recall that 
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in Sec. IIII-BI we assumed an infinite-capacity backhaul. This 
allows us to target our efforts on the analysis of the effect 
of a noisy channel estimation onto the performance of the 
two-tiered network. The study of the achievable performance 
for a two-tiered network operating under limited backhaul 
capacity, and the impact of the quantization of the CSI, will 
be a subject of future work. The design of a suitable channel 
estimation procedure is out of the scope of this work as well, 
thus, for simplicity, we assume a classic training/transmission 
scheme as in l36l . 

Consider a block fading channel model where a channel 
estimation is valid throughout the duration of the coherence 
time T. The channel estimations are performed during a period 
t < T, hence the available time for transmission is upper 
bounded by T — r. During the training phase the devices 
broadcast orthonormal sequences of known pilot symbols of 
equal power. Each channel observation can be expressed as 

r = Vp^h + n, (37) 



where h is the channel vector, p is the transmit power and 
n ~ Cj\f(0, o^I( i+1 \) models the effect of the Gaussian 
noise at the devices' circuitry, driving the experienced SNR 
at the estimating device. Each device computes the minimum 
mean-square error (MMSE) estimate of h, by evaluating the 
observation r. At this stage, h can be expressed by means 
of two components, i.e., an estimate h and an independent 
Gaussian error h 11371 . that is 



h = h + h. 



(38) 



Without the perfect CSI assumption at the SBSs, the zero in- 
terference constraint in (Q3) can no longer be satisfied, thus the 
SBSs may generate interference towards the MUEs. If we de- 
note the j th row of H mm as = [[H^Jji, . . . , [H ram ] jA r], 
and the j th row of H sm as hffi = [H sm ]ji, . . . , [U sm ] J7 ^ KL ], 
then the SINR per received symbol at the MUEs reads 



SINR 



P Klh^l 2 

1 m 1 ^ 11 mm 



,Vj£[i,iV]. (39) 

Note that, the imperfect CSI at the SBSs has an impact on the 
general design of W, worsening the SINR per received symbol 
at the SUEs, due to channel estimation effects and increased 
co-tier interference component. Therefore, (l35i does not hold 
for this case and each SUE experiences an effective SINR 
value f36l per received symbol given by 

"<l> u '\ 3 



SINR (S ),, = 



ih (j V' ) p + tr<ww " 

l"ss V I ' P,K(N-\ 



l + (l + r)- 



ihj j V j) i 2 



(40) 



\K 



)^(0| 3 , tr(WW H )ir» 
Tr I + P S K(N + L) 



Vj £ [1, KN], where we assume that the same transmit power 
is used for training and data symbols. Then, the sum-rate of 
the MBS and SBSs respectively is 



c: 



SUM, I 



T 



N 



SUM, I 



T(N- 
T - 



E lo §2( 1 



SINR 



(m) ,3 



) (41) 



T(N + L 



3 = L 

KN 

■£ 



log 2 (l + SINR (s)j ). (42) 



To reduce Monte Carlo simulation times, we consider N = 64 
active subcarriers, cyclic prefix length of L = 16 and a load 
rate of (3 = 1. In Fig. [5] the ratio between the sum-rate 
obtained with imperfect CSIT and the sum-rate obtained with 
perfect CSIT is computed for the MBS and the SBSs as dif- 
ferent t/T proportions are chosen, for SNR £ {0, 10, 20} dB. 
Consider the MBS. We note that the optimal t hinges on the 



GO 

a o.8 



I 0.6 



0.4 





-©- SNR = dB 




-B-SNR= lOdB 




-±-SNR = 20dB 







0.1 



0.2 



0.3 



0.4 



0.5 



0.8 



0.6 



P4 0.4 
0.2 





SNR = dB 
-»-SNR= 10 dB 
-A-SNR = 20dB 









0.1 



0.2 



0.3 



0.4 



0.5 



Figure 5. Ratio between the rate obtained with imperfect CSIT and the rate 
obtained with perfect CSIT for MBS and SBSs as the SNR changes, f3 = 1 
and K = 3 (N = 64, L = 16 and bandwidth of 0.96 Mhz). 



SNR and, in particular, t = 0.1T is the optimal value in the 
low to medium SNR regime. On the other hand, the result 
for SNR= 20 dB shows that the pre-log factor dominates 
the sum-rate in this regime, and the best performance is 
obtained for the minimum value considered in the simulation, 
i.e., t = 0.05T. Interestingly, the rate loss experienced 
by the MBS for SNR= dB is around 22%. Thus, the 
cross-tier interference cancelation provided by MU-VFDM 
shows a promising robustness to imperfect CSI even if 
the experienced SNR is very low. Switching our focus to 
the second tier, we see that the impact of the channel 
estimation errors at the SBSs on the effectiveness of the 
co-tier interference mitigation is larger. As a result, the SBSs 
experience a non-negligible sum-rate loss for imperfect CSIT, 
especially at very low SNR. However, differently from what 
we have seen for the MBS, the optimal value for r does 
not show a clear dependence on the SNR, being consistently 
r = 0.2T throughout the considered SNR range. In particular, 
we note how the sum-rate loss varies slowly with r. This 
implies that small variations on the available time for the 
channel estimation w.r.t. the optimal r are acceptable by the 
SBSs, allowing for faster suboptimal channel estimations if 
necessary. 

To conclude the analysis on the impact of the imperfect 
CSIT on the performance of the two-tiered network, we 
test how MU-VFDM performs as the number of transmit 
dimensions in the second tier increases. There are two choices 
at hand: either we modify the ratio between the number of 
transmit and receive dimensions, i.e., j rx and K fixed and 
j3 increases, or simply deploy more SBS/SUE pairs, i.e., j rx 
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and p fixed and K increases. Therefore, we let the load rate 
P e {1, 2, 3} in Fig.[6](with ^ rx = 1, K = 3), and the number 
of SBSs K G {1,3,6} in Fig. (with 7ra; = 1, = 3). 
We assume a constant SNR= 10 dB. We first focus on the 
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Figure 6. Ratio between the rate obtained with imperfect CSIT and the rate 
obtained with perfect CSIT for MBS and SBSs as /3 changes, SNR = 10 dB 
and K = 3 (JV = 64, L = 16 and bandwidth of 0.96 Mhz). 




Figure 7. Ratio between the rate obtained with imperfect CSIT and the rate 
obtained with perfect CSIT for MBS and SBSs as K changes, SNR = 10 dB 
and = 1 (N = 64, L = 16 and bandwidth of 0.96 Mhz). 

MBS sum-rate loss. By comparing the two cases, MU-VFDM 
confirms its robustness to imperfect CSIT and effectiveness 
for what concerns the cross-tier interference cancelation, re- 
gardless of the adopted approach. In particular, we note that 
r = 0.1T is optimal for every tested configuration. The 
sum-rate loss of the SBSs shows a similar trend for the two 
considered approaches, despite the difference in the optimal 
value for t, i.e., r = 0.1T in Fig. [6] and t = 0.15T in Fig. 
[7] Nevertheless, we notice that the sum-rate loss for the SBSs 
increases as K increases, but remarkably shows a decreasing 
behavior as j3 increases. This interesting result is due to the 
higher transmit diversity gain experienced by the SBSs as 
P increases. If the number of transmit dimensions is largely 
greater than the number of receive dimensions, the diversity 



gain can compensate the rate loss due to the reduced co-tier 
interference mitigation provided by MU-VFDM in the second- 
tier for imperfect CSIT, showing the potential of a densely 
deployed second-tier adopting the proposed technique. 

C. Comparison with existing solutions 

In this section, we evaluate the performance of the pro- 
posed scheme by comparing MU-VFDM to state-of-the-art 
approaches that allow the deployment of a two-tiered net- 
work (3) : 1) complete sharing, 2) partial sharing, 3) com- 
plete separation. MU-VFDM allows the coexistence of SBSs 
and MBS inside the same area, canceling the interference 
from the former to the latter, adopting a complete sharing 
approach. Among the aforementioned bandwidth management 
schemes, only the complete separation approach guarantees 
zero interference from the SBSs to the MUEs. Therefore, for 
a fair comparison, we focus on this approach and divide the 
available bandwidth in two portions assigned exclusively to 
the MBS and the SBSs. Considering the values introduced 
previously, i.e., N = 64 and L = 16, this implies that both the 
MU-VFDM and the complete separation based system trans- 
mit over a bandwidth B = 0.96 MHz. As seen in Sec. IIII-AI 
by implementing MU-VFDM, each SBS can transmit up to L 
input symbols from each SBS' antenna. On the other hand, 
the MBS transmits N input symbols, i.e., the number of con- 
sidered subcarriers. Consequently, in the complete separation 
approach, we assign a bandwidth B s = 2± to the SBSs and 
B m = B — B s to the MBS. By means of this division, we 
ensure that each SBS' antenna is transmitting the same number 
of symbols as in MU-VFDM. Moreover, in order to exploit all 
the available transmit dimensions, we assume that the SBSs 
perform a network MIMO-OFDMA transmission towards the 
SUEs, adopting a ZF precoding such that no linear processing 
at the SUEs is required, as in MU-VFDM. Note that, a legacy 
OFDMA transmission is performed by the MBS as described 
previously. As a last remark, differently from what we have 
assumed so far, we assume that in MU-VFDM the SUEs suffer 
from full interference from the MBS. This allows for a more 
realistic and fair comparison, accounting both for advantages 
and drawbacks of the two different bandwidth management 
approaches. We let P = 3 and K = 6. In Fig. [8] the achievable 
rate of the two schemes for perfect CSIT is presented. The 
complete sharing approach implemented through MU-VFDM 
shows a clear advantage over the complete separation scheme, 
at it provides a larger overall sum-rate of the two-tiered 
network at all SNR regimes. This is achieved despite the 
large impact of the cross-tier interference from the MBS 
to the SUEs, clearly noticeable at medium and high SNR 
values. This remarkable result motivates a further comparison, 
when only imperfect CSIT is available. In Fig. [9] we see 
that MU-VFDM achieves a slightly worse performance if 
compared to the previous case, even if the overall sum-rate 
of the two-tiered network is still higher than the performance 
of complete separation scheme, for SNR values greater than 
7 dB. Due to the nature of the cascaded precoder a wrong 
channel estimation deteriorates the performance especially 
for low SNR values. On the other hand, the advantage for 
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ment of SBSs inside the coverage area of a preexist- 
ing MBS. We focused on the coexistence over the same 
bandwidth between a downlink LTE MBS and an SBS 
system, to illustrate both the feasibility and the perfor- 
mance of MU-VFDM. The network MIMO assumption 
made a potentially interference limited system become a 
MIMO-BC. Thanks to this fact, several linear precoding tech- 
niques involving cooperation between transmitters have been 
taken into account, and the inherent dimensionality constraint 
due to the structure of the precoder E has been identified. 
The search for a suitable scheme brought us to the proposed 
flexible RIBF based approach presented in Sec. IIII-DI In- 
creasing the number of transmit dimensions, while keeping 
the receiver layout, is a viable way to design a system that 
overcomes the dimensionality problem and achieves relevant 
performance in terms of sum-rate. Such a system design can 
be realized either by extra antenna installation, denser SBS 
deployment or a flexible combination of both. The relaxation 
of the perfect CSIT assumption at the SBSs results in rate loss 
experienced by both systems, due to the imperfectly devised 
precoder. The best compromise between training and data 
symbols has been investigated, for various SNR values, as well 
as the best performing strategy to deploy a dense network 
for the imperfect CSIT case. Finally, a comparison with 
state-of-the-art techniques has shown a consistent advantage 
of the proposed technique for a large range of SNR values, 
both for perfect and imperfect CSIT case. The results pre- 
sented herein reinforce our previous findings and confirm that 
MU-VFDM can be used to deploy SBSs and MBS coexisting 
inside the same coverage area, sharing the same band. 

The analysis of the performance of this scheme under 
limited backhaul capacity assumption is matter of our future 
research, along with the impact of a partial cooperation 
between the SBSs. Moreover, we will move from a fully 
coordinated to a clustered network MIMO scenario, to find dif- 
ferent and more practically implementable ways to manage the 
co-tier interference while guaranteeing the cross-tier interfer- 
ence cancelation. 



Figure 9. Achievable rate of a two-tiered network, 
cross-tier interference MBS -> SUEs (TV = 64, L = 
0.96 Mhz). Imperfect CSIT. 



K = 6, /3 = 3 and 
16 and bandwidth of 



other SNR regimes is evident and promising, despite the high 
impact that, as before, the MBS' cross-tier interference has 
on the performance of the second-tier for medium and high 
SNR. Therefore, MU-VFDM is able to exploit efficiently the 
higher multiplexing gain provided by the complete sharing 
approach, at the expense of a slightly worse performance for 
low SNR if compared to the complete separation strategy, for 
the imperfect CSIT case. Consequently, the coexistence in the 
two-tiered network can be achieved, effectively enhancing the 
spectral efficiency and the capacity per area, for both CSIT 
assumptions. 

V. Conclusion 

In this work, we presented a DSA cognitive over- 
lay technique, called MU-VFDM, that allows the deploy- 
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